Controlled Language and Knowledge-based Machine Translation: Principles and Practice 1
نویسندگان
چکیده
For applications with certain properties (well-deened technical domain, technical vocabulary, simple grammatical style), use of a controlled language can enhance the accuracy of knowledge-based machine translation (Baker, et al., 1994). In the continuing development of the KANT system (Mitamura, et al., 1991; Mitamura and Nyberg, 1995), we have explored diierent sublanguage techniques which limit the complexity of natural language analysis, thus increasing the likelihood of an accurate translation. In this presentation, we will describe some of the general techniques that have been developed for sublan-guage use within the KANT system. 1 Introduction In recent years there has been increasing interest in the use of controlled source languages in MT (cf. Adriaens and Schreurs, 1992 and the references cited there). In this paper, we focus on the use of controlled input language in the KANT translation system (Mitamura, et al., 1991). Controlled English is used to improve the clarity of expression in the source text, and to improve the quality of the translation output. KANT is a knowledge-based, interlingual machine translation system, a descendant of the KBMT-89 system (Goodman and Nirenburg, eds., 1991). KANT uses explicit source language lexicons, grammars and domain semantics to produce an interlingua representation (IR) for each sentence. Each IR is a semantic frame containing features and semantic roles, which may be lled by other IR frames. If a source sentence has more than one possible analysis, KANT produces more than one IR structure. The input to the target language generation module is the set of IRs produced for a source sentence. The decoupling of the analysis and generation phases has
منابع مشابه
Controlled Language and Knowledge-Based Machine Translation: Principles and Practice
For applications with certain properties (well-defined technical domain, technical vocabulary, simple grammatical style), use of a controlled language can enhance the accuracy of knowledge-based machine translation (Baker, et al., 1994). In the continuing development of the KANT system (Mitamura, et al., 1991; Mitamura and Nyberg, 1995), we have explored different sublanguage techniques which l...
متن کاملAssessing the Quality of Persian Translation of the Book “Principles of Marketing” Based on the House’s (TQA) Model
Translation is evaluated in terms of its forms and functions inside the historically developed systems of the receiving culture and literature. This study aimed to evaluate the quality of Persian translation of the14th edition of the original English book “Principles of Marketing” written by Philip Kotler and Gary Armstrong based on House (TQA) model: overt and covert translation distinction. T...
متن کاملControlled English for Knowledge - Based MT : Experience with the KANT System 1
In this paper, we describe the design and deployment of KANT Controlled English (KCE) for knowledge-based machine translation in the KANT system. KCE combines three kinds of constraints: constraints on the lexicon; constraints on the complexity of sentences; and the use of generalized markup language. We describe how each of these types of language control are utilized in the implementation of ...
متن کاملA new model for persian multi-part words edition based on statistical machine translation
Multi-part words in English language are hyphenated and hyphen is used to separate different parts. Persian language consists of multi-part words as well. Based on Persian morphology, half-space character is needed to separate parts of multi-part words where in many cases people incorrectly use space character instead of half-space character. This common incorrectly use of space leads to some s...
متن کاملControlled English for Knowledge-Based MT: Experience with the KANT System
In this paper, we describe the design and deployment of KANT Controlled English (KCE) for knowledge-based machine translation in the KANT system. KCE combines three kinds of constraints: constraints on the lexicon; constraints on the complexity of sentences; and the use of generalized markup language. We describe how each of these types of language control are utilized in the implementation of ...
متن کامل